Data Modeling with Postgres
Table Creation
| Criteria | Meet Specification |
|---|---|
|
Table creation script runs without errors. |
The script, |
|
Fact and dimensional tables for a star schema are properly defined. |
CREATE statements in |
ETL
| Criteria | Meet Specification |
|---|---|
|
ETL script runs without errors. |
The script, Since this is a subset of the much larger dataset, the solution dataset will only have 1 row with values for value containing ID for both |
|
ETL script properly processes transformations in Python. |
INSERT statements are correctly written for each table, and handle existing records where appropriate. |
Code Quality
| Criteria | Meet Specification |
|---|---|
|
The project shows proper use of documentation. |
The README file includes a summary of the project, how to run the Python scripts, and an explanation of the files in the repository. Comments are used effectively and each function has a docstring. |
|
The project code is clean and modular. |
Scripts have an intuitive, easy-to-follow structure with code separated into logical functions. Naming for variables and functions follows the PEP8 style guidelines. |
Tips to make your project standout:
- Insert data using the COPY command to bulk insert log files instead of using INSERT on one row at a time
- Add data quality checks
- Create a dashboard for analytic queries on your new database